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ABSTRACT 

Using public data from the NEWFIRM Medium-Band Survey (NMBS) and the Cosmic Assembly Near-Infrared 
Deep Extragalactic Legacy Survey (CANDELS), we investigate the population of massive galaxies at z > 3. 

The main aim of this work is to identify the potential progenitors of z ~ 2 compact, massive, quiescent galaxies 
(CMQGs), furthering our understanding of the onset and evolution of massive galaxies. Our work is enabled by 
high-resolution images from CANDELS data and accurate photometric redshifts, stellar masses, and star formation 
rates (SFRs) from 37-band NMBS photometry. The total number of massive galaxies at z > 3 is consistent with 
the number of massive, quiescent galaxies (MQGs) at z ~ 2, implying that the SFRs for all of these galaxies must 
be much lower by z ~ 2. We discover four CMQGs at z > 3, pushing back the time for which such galaxies have 
been observed. However, the volume density for these galaxies is significantly less than that of galaxies at z < 2 
with similar masses, SFRs, and sizes, implying that additional CMQGs must be created in the intervening ~1 Gyr 
between z = 3 and z = 2. We find five star- forming galaxies at z ~ 3 that are compact (R e <1.4 kpc) and have 
stellar mass M* > 10 10,6 M 0 ; these galaxies are likely to become members of the massive, quiescent, compact 
galaxy population at z ~ 2. We evolve the stellar masses and SFRs of each individual z > 3 galaxy adopting five 
different star formation histories (SFHs) and studying the resulting population of massive galaxies at z = 2.3. We 
find that declining or truncated SFHs are necessary to match the observed number density of MQGs at z ~ 2, 
whereas a constant delayed-exponential SFH would result in a number density significantly smaller than observed. 

All of our assumed SFHs imply number densities of CMQGs at z ~ 2 that are consistent with the observed number 
density. Better agreement with the observed number density of CMQGs at z ~ 2 is obtained if merging is included 
in the analysis and better still if star formation quenching is assumed to shortly follow the merging event, as implied 
by recent models of the formation of MQGs. 
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1. INTRODUCTION 

The population of galaxies in the Local Universe presents a 
clear bi-modality, as evidenced by color-magnitude diagrams 
(e.g., Baldry et al. 2004; Kauffmann et al. 2004), with galaxies 
either living on the red sequence or in the blue cloud. This bi- 
modality is further supported by tight correlations between the 
main physical properties of each class (e.g., Tully & Fisher 1977 ; 
Kormendy & Djorgovski 1989). Galaxies on the red sequence 
typically are massive and quiescent (i.e., with low or no ongoing 
star formation) with early-type morphologies, while galaxies in 
the blue cloud are less massive, with higher star formation rates 
(SFRs) and have spiral or irregular morphologies. 

The physical mechanisms driving the onset of the observed 
bi-modality are one of the main open issues in the study of 
galaxy formation. At high redshift the observational picture is 
complicated by the lack of a common definition for massive, 
quiescent galaxies (MQGs) in the literature (see discussion in 
Saracco et al. 2012). 

Samples selected according to the spectral energy distri- 
bution (SED) show that massive (i.e., stellar mass M * > 
10 11 Mq) and quiescent (i.e., specific star formation rate 
sSFR < 10“ 11 -10“ 10 yr -1 ) galaxies were already in place at 
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z ~ 2 (Franx et al. 2003; Daddi et al. 2005; Kriek et al. 2006; 
Cimatti et al. 2008). Their number density has grown by almost 
a factor of 10 since z = 3 (Labbe et al. 2005; Arnouts et al. 
2007; Fontana et al. 2009; Taylor et al. 2009; Ilbert et al. 2010; 
Cassata et al. 2011; Brammer et al. 2011; Dominguez Sanchez 
et al. 2011; Bell et al. 2012), although most of the evolution 
occurred at z > 1 (e.g., Pozzetti et al. 2007). 

In addition to rapid number density evolution, the sizes of 
MQGs have evolved dramatically from high redshift to the 
present day. Most of MQGs at z > 1.5 are also compact 
systems: the observed effective radii are generally a factor of 
~3-5 smaller at z ~ 2.5 (Daddi et al. 2005; Longhetti et al. 
2007; Toft et al. 2007; Trujillo et al. 2007; van Dokkum et al. 
2008; Cimatti et al. 2008; Saracco et al. 2009; van der Wei 
et al. 2008; Bezanson et al. 2009; Szomoru et al. 2012) than at 
z = 0, where such compact galaxies are almost entirely absent 
(Trujillo et al. 2009; Taylor et al. 2010). 

There have been multiple attempts to explain the evolution 
of these objects and to place them into a broader cosmological 
context. In the so-called monolithic collapse scenario, MQGs 
would have assembled almost all of their stellar mass at 
high redshift, followed by a passive evolution of the stellar 
population. This class of models, however, foresees little or no 
evolution in size, in contrast to observations. More recent models 
describe the formation of MQGs as a two- stage process: gas-rich 
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merger events generate compact, massive spheroids at z > 3, 
while minor, dry (i.e., without formation of new stars) mergers 
at later cosmic times would increase the size, while responsible 
for only a small increase in stellar mass (Naab et al. 2007, 
2009; Wuyts et al. 2010; Oser et al. 2012). Indeed, Bezanson 
et al. (2009) showed that the stellar mass density of the inner 
~ 1 kpc of z ~ 2 galaxies does not substantially differ from that 
in present-day MQGs, suggesting that the growth during the 
second phase progresses from the inner region rapidly toward 
the external parts (the so-called inside-out growth; see, e.g., van 
Dokkum et al. 2010). 

However, the picture is still far from being completely clear: 
detailed comparisons with ACDM models of dry merging show 
that some of the models predict descendants of z > 2 too 
compact compared to the observed local MQGs (Cimatti et al. 
2012). Recent studies have also revealed the existence of a large 
number of MQGs at z ~ 1.5 with sizes similar to those of 
the local galaxies with comparable mass (Saracco et al. 2009, 
2010; Mancini et al. 2010). If the population of MQGs at 
z > 2.5 includes only compact systems, this would imply a size 
evolution timescale of ~ 1 Gyr, challenging the current models 
of formation of local MQGs (Saracco et al. 2010). 

The main goal of this work is to identify, within an obser- 
vational framework, the population of galaxies at z > 3 which 
could give rise to the population of compact MQGs (CMQGs) 
observed at z = 2.3. We combine data from two public sur- 
veys, the NEWFIRM Medium-Band Survey (NMBS; Whitaker 
et al. 2011) and the Cosmic Assembly Near-infrared Deep Ex- 
tragalactic Legacy Survey (CANDELS; Grogin et al. 2011; 
Koekemoer et al. 2011). The NMBS provides accurate mea- 
surements of photometric redshifts (z p hotX stellar masses (AT*), 
and SFR, while robust morphologies and sizes were measured 
from the high-resolution imaging from CANDELS. 

In Section 2, we describe the data sets used in our work, the 
measurements of the sizes, and the completeness in stellar mass. 
The results are presented in Section 3. We discuss our results in 
Section 4, and summarize them in Section 5. 

Throughout this work, we use a concordance cosmology with 
Hq = 70 Km s -1 Mpc -1 , Cl m = 0.3, and = 0.7. All 
magnitudes are referred to the AB system. 

2. DATA 

We combined the photometric depth and the accurate mea- 
surements of photometric redshifts offered by the NMBS (van 
Dokkum et al. 2009; Whitaker et al. 2011) with the high- 
resolution HST/ WFC3 deep NIR imaging from the CANDELS 
(Grogin et al. 201 1 ; Koekemoer et al. 2011). These two surveys 
will be briefly described in the next two sections. 

2.7. NMBS-COSMOS 

The NMBS covers two fields: a 27? 6 x 27? 6 pointing within 
the COSMOS field (Scoville et al. 2007) and a second pointing, 
of the same size, overlapping with part of the All-Wavelength 
Extended Groth Strip International Survey (AEGIS) strip (Davis 
et al. 2007). Only data from the COSMOS field were used, since 
the overlap between NMBS-AEGIS and CANDELS is minimal; 
using both fields the average improvement in cosmic variance 
would have been just 5%. 

NMBS uses five medium-bandwidth filters in the NIR wave- 
length range 1-1.8 /xm. The bluest filter is similar to the T band, 
the canonical J and H bands are split into two filters each. Such 
configuration pinpoints the location of the redshifted B aimer/ 


4000 A breaks in 1.5 < z < 3.5 galaxies (van Dokkum et al. 
2009) and enables photometric redshift measurements with an 
accuracy of cr z /(l + z) ~ 2% for objects in the redshift range 
1.5 < z < 3.5 (Whitaker et al. 2011). 

The full public catalog for the COSMOS field (Whitaker 
et al. 201 1) 6 provides UV-to-24 /xm photometry for 31,306 
K-selected sources based on the de-blended version, along 
with accurate photometric redshifts, rest-frame luminosities, 
SFRs, and stellar mass measurements. The redshifts and stel- 
lar masses were computed using 37 filters from the COSMOS 
fields, combining the NIR medium-bandwidth data with ex- 
isting UV ( Galaxy Evolution Explorer ), visible and NIR 
(Canada-France-Hawaii Telescope and Subaru Telescope), and 
mid-IR (Spitzer/ IRAC) imaging (Whitaker et al. 2011). 

The catalog is complete at the 75% detection level for 
magnitudes brighter than K s = 23.1AB (Whitaker et al. 
2011); we selected objects brighter than K s = 23.4 mag, 
corresponding to a detection completeness of 50%. 

2.2. CANDELS 

CANDELS is a 902-orbit Hubble Space Telescope (HST) 
Multi-Cycle Treasury program aimed at probing the evolution 
of galaxies and black holes from z ~ 1.5 to 8 and at detecting 
and studying Type la supernovae at z > 1.5 in order to better 
constrain the nature of dark energy (Koekemoer et al. 201 1). 

The wide portion of the survey covers a total of ~800 arcmin 2 
down to H ~ 26.5 mag, spread over five fields: extended regions 
around the two GOODS fields, the Extended Groth Strip (EGS; 
Davis et al. 2007), COSMOS (Scoville et al. 2007), and the 
UKIDSS Ultra-Deep Survey (UDS; Lawrence et al. 2007). 

We used the F160W as the filter for the measurement of the 
morphological parameters as it is the reddest band available 
with high-resolution imaging. At redshift z ~ 3, the F160W 
filter corresponds to the rest-frame B band. 

2.3. Sample Selection 

The FWHM of the point-spread function (PSF) on the CAN- 
DELS F160W images is FWHM ~ 0? 17-0? 19 (Koekemoer 
et al. 201 1), a factor of ~6 better than the K-band PSF in NMBS. 
This fact, together with the higher photometric depth compared 
to the NMBS survey, translates into a higher surface brightness 
sensitivity than NMBS. 

From the full NMBS catalog we extracted those non-stellar 
objects and objects with a good quality flag (star^flag = 0 and 
use = 1). We selected galaxies compatible with being at z > 3 
at the lcr level (i.e., u_68 > 3.). In this way, we selected a total 
of 613 galaxies. 

The overlap of the CANDELS COSMOS image and the 
NMBS COSMOS field amounts to 192 arcmin 2 , ~20% of the 
original NMBS field. This reduces the sample of z > 3 galaxies 
available to the measurement of the morphological parameters 
to 133 sources. 

The availability of CANDELS data allowed us to accurately 
measure the morphological parameters of our z > 3 galaxies 
and to identify potentially blended objects, which appear as 
single sources in the NMBS catalog. In Figure 1, we show an 
example of the importance of high-resolution imaging, which 
allowed us to identify those cases in which blended and/or 
very close objects could affect the measurement of the SED, 
with consequently unreliable photometric redshifts and stellar 

6 Catalogs can be downloaded from 
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Figure 1. Example of the need for high-resolution images such as those from CANDELS in identifying blended objects in lower resolution images. From left to right: 
(a) tile from the F160W CANDELS image; (b) segmentation map from NMBS (gray scale) and from the F160W CANDELS frame (colored spots); the purple + 
indicates the center of the NMBS source, which is the center of the frame; the axis values are in arcseconds relative to the center of the tile; (c) residual image from 
GALFIT after fitting all sources simultaneously; the pixel scale for tiles (a), (b), and (c) is the same, (d) Observed SED from NMBS (black points with error bars) and 
SED fit from EAZY (solid red line). The blended objects like the one here plotted were excluded from the analysis. 

(A color version of this figure is available in the online journal.) 
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Figure 2. Observed SED from NMBS (black points), EAZY best- fitting SED 
(solid red curve) for a possible AGN. A color cutout created from the F814W, 
F125W, and F160W CANDELS filters is shown in the top left corner, while 
a cutout from CANDELS F160W image is presented at the bottom left. The 
angular size for both cutouts is 377 x 377. Objects like the one plotted here were 
excluded from the sample. 

(A color version of this figure is available in the online journal.) 

population properties (e.g., stellar masses, SFRs). For these 
reasons, blended objects were excluded from our final analysis. 
We also visually inspected the 133 SEDs of the z > 3 galaxies 
and excluded from the sample those showing the presence of a 
possible active galactic nucleus (AGN; an example of which is 
presented in Figure 2). The fraction of such objects in the whole 
sample of 133 galaxies summed to 4%. 

The final catalog includes 110 galaxies. The distribution of 
M* with Zphot for the final sample is presented in Figure 3. The 
median redshift and redshift uncertainty for the full sample are 
Ztot = 3.2 and 0.1, respectively; similarly, for the M* > 10 11 M© 
subsample, we have zm = 3.2 ± 0.3. The plot also shows that 
out of the 10 objects with redshift compatible with z ~ 2.3, 
none of them has a stellar mass M * > 10 11 M©, increasing our 
confidence in the adopted redshift selection criteria. 

2.4. Size Measurement 

The luminosity profile of each galaxy was fit by a single 
Sersic (1968) profile, using the GALFIT (Peng et al. 2002, 2010) 
program. This code fits two-dimensional analytic functions 
directly to images after convolving the profile with a PSF. All 
size measurements were done on the F160W CANDELS image. 



Redshift 

Figure 3. Stellar mass as a function of redshift for the 110 galaxies constituting 
our final sample (blue filled squares), after cleaning for blended objects (gray 
filled circles), and possible AGNs (red filled diamonds). The error bars in 
redshift encompass the 68% confidence level, as estimated by EAZY. The 
median redshift of M* > 10 11 Mq galaxies is 3.2, and it is marked by the 
vertical dashed line. All M* > 10 11 Mq galaxies, selected to possibly lie at 
z > 3, have a redshift z > 2.5 at a 68% confidence level. This gives us 
confidence that our selection is not including z ~ 2 galaxies with very broad 
redshift probability distributions. 

(A color version of this figure is available in the online journal.) 

A zero-order set of morphological parameters was obtained 
via SExtractor (Bertin & Arnouts 1996) on the F160W image. 
In order to make the measurements of the morphological 
parameters with GALFIT as robust and reliable as possible, 
two fundamental steps were taken into account. The first is the 
construction of the PSF, while the second is the evaluation of 
the sky background level. 

The PSF was constructed with IRAF/Daophot (Stetson 
1987), from a set of unsaturated, bright, and isolated stars, and 
using a Gaussian analytic function plus a look-up table built by 
the IRAF psf task from the residuals of the function fitting. This 
allowed us to take into account the anisotropies of the brightness 
profile, especially those in the wings of the PSF. 

The SExtractor sky measurement is based on thresholding: 
the sky is measured by determining when the gradient of the 
flux flattens out enough compared to the background noise. 
In particular, this means that SExtractor will generally over- 
predict the sky with the effect of suppressing the Sersic in- 
dex, the effective radius, and luminosity. In our analysis, the 
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Figure 4. Specific star formation rate as a function of R e . Associated errors on 
R e are from GALFIT. Points are color-coded according to their stellar mass. 
The sample defined by log(M*/M 0 ) >11 and marked in the plot by the large 
red filled squares is complete in stellar mass at the 70% level. As such, this is 
the sample we adopt to draw our conclusions. The vertical gray region marks 
the limit to which we can resolve objects, determined by fitting stars with a 
Sersic (1968) profile. We note that, given the existence of a size-mass relation, 
the vertical line at R e = 2 kpc marking the separation between compact and 
extended galaxies only applies to log (M*/Af Q ) >11 galaxies (see Section 3.2 
for a more complete discussion). The inset shows the cumulative fraction of 
galaxies according to their effective radius R e for the full sample (solid black 
line), for the massive (log(M*/M 0 ) >11) galaxies (dotted red line), and for the 
compact ( R e < 2 kpc), massive, quiescent (log(sSFR/yr _1 ) < —11) sample 
(dashed purple line). A vertical gray line marks our limit in R e for the compact 
galaxies. Approximately 70% of the galaxies in our (flux-limited) sample have 
R e < 2 kpc; 4 out of the 10 massive galaxies are also compact. No extended 
quiescent galaxies are present, although this could be the result of a surface 
brightness effect. Star-forming galaxies show no evident correlation between 
R e and sSFR. 

(A color version of this figure is available in the online journal.) 


background level was estimated from its median value in 30 non- 
overlapping boxes 50 pixels (3") wide, distributed across a re- 
gion of 400 pixels (24") around the central object, and free from 
any other source, as probed by the SExtractor segmentation map. 
The region size is wide enough to grant robust statistics and at 
the same time it is small enough to mitigate possible gradients 
in the sky background. The boxes were kept to an additional 
distance of 076 from the segmentation map, increasing the con- 
fidence in excluding contamination from the outskirts of objects. 

All objects in a box of 150 pixels (9") around each galaxy 
were simultaneously fit, in order to take into account possi- 
ble contamination of the brightness profile from neighboring 
objects, which could bias the measured size and Sersic index. 

The effective radii were finally circularized and converted to 
physical units using the adopted cosmology. 

The resulting distribution of R e as a function of the sSFR for 
the galaxies of our sample is presented in Figure 4. 

2.5. Stellar Mass Completeness 

Consistent statistical measurements of intrinsic physical 
quantities rely on the accurate characterization of the selection 
effects, i.e., on the completeness of the sample. 

The galaxy stellar mass completeness for a flux-limited 
sample not only directly depends on the limiting flux itself but 
also on the mass-to-light ratio (. M/L ) of each galaxy. All else 
being the same, galaxies with lower M/L values will be probed 
to lower mass limits; similarly, it is possible to probe only the 
higher stellar mass region for those objects with higher M/L 
ratios (see, e.g., Marchesini et al. 2009). 
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Figure 5. Stellar mass as a function of the rest-frame V-band mass-to-light 
ratio for all the NMBS galaxies with redshift compatible with being at 
z > 3. Points are color-coded according to their stellar mass. Objects covered 
by the CANDELS F160W frame which allowed for size measurement are 
marked by larger symbols, with the symbol size proportional to R e . The 50% 
completeness level, computed adopting the K-bmd detection completeness 
curve from Whitaker et al. (201 1), is marked by the yellow dashed curve, while 
the 80% completeness level by the cyan dash-dotted curve. The vertical arrow 
marks the M/L of a single stellar population passively evolved to z = 3.2, 
smaller than the maximum value we observe for the massive population, 
increasing our confidence on our completeness analysis. The completeness of 
the M* > 10 11 Mq galaxies is >70%. 

(A color version of this figure is available in the online journal.) 
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The above effect makes the measurement of the stellar mass 
completeness a non trivial task; common approaches involve 
either Single Stellar Population (SSP) modeling (Dickinson 
et al. 2003) and/or comparing the galaxy populations to existing 
deeper data (Marchesini et al. 2010). 

The average PSF on NMBS frames is ~ 1 7 1 , corresponding 
to an R e of ~07 6, or ~5 kpc at z = 3.2. In particular, this means 
that objects with R e < 5 kpc are essentially point sources on the 
NMBS frames. The inset in Figure 4 shows that the totality of 
the galaxies in our sample has R e < 5 kpc. We therefore adopt 
for each galaxy the K - band completeness curve measured for 
point sources for the NMBS survey and presented in Whitaker 
et al. (2011). 

The completeness of the z > 3 sample as a function of stellar 
mass was analyzed for different ranges in M/L ratio. Objects 
with lo g(M*/M 0 ) > 11 are complete at the ~70% level. A 
graphical representation is plotted in Figure 5, which shows the 
1 og(M*/M 0 ) versus M/L ratio diagram for all NMBS galaxies 
at z > 3, as well as the 50% and 80% completeness curves of 
the stellar mass as a function of M/L ratio derived from the 
K - band detection completeness curve from Whitaker et al. 
(2011). Figure 5 shows that for galaxies with 1 og(M*/M©) > 
11, the completeness is >70%. Also plotted is the M/L at 
z = 3.2 of a passively evolving single stellar population formed 
at Zform = 20. Its value, equal to 0.78 M 0 /T 0 ,v, well below the 
maximum value we observe for the massive sample, provides 
additional confidence in the adopted completeness limits in 
stellar mass. We furthermore note that the completeness in 
stellar mass as a function of redshift is fairly flat over the 
redshift range targeted in our work, changing by, e.g., ~0.1 dex 
from z = 4 to z = 3 for a single stellar population formed at 
£ form = 10. In our analysis we then selected only those galaxies 
with lo g(M*/M 0 ) >11 (for a total of 10 objects) for which 
small completeness corrections are required; specifically, the 
correction for incompleteness ranged from 1.05 to 1.5. 
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In subsequent sections, we will evolve our z ~ 3 sample 
to z ~ 2 to assess its correspondence to the z ~ 2 observed 
population. In principle, we should also estimate completeness 
levels in stellar mass for this evolved population. This could 
be done in a way similar as that implemented for the z > 3 
sample. In particular, this means that for each object the final 
completeness correction for the z — 2 sample would be the 
product between the completeness correction at z — 3 and that 
at z — 2. However, as shown by Brammer et al. (2011), the 
stellar mass completeness at z — 2 is 95% or better for galaxies 
with stellar mass log(M*) >11, which coincides with our stellar 
mass selection limit. This implies that no further correction is 
needed for the massive z — 2 sample. 

2.6. Cosmic Variance 

Given the small region of overlap between the NMBS- 
COSMOS and CANDELS-COSMOS fields, the statistical un- 
certainties associated with fluctuations of the large-scale density 
(i.e., the cosmic variance) play a non-negligible role. We mea- 
sured such effects following the recipe by Moster et al. (201 1). 
A halo distribution model is used to relate the stellar mass to the 
dark matter halo as a function of redshift; the galaxy bias is then 
estimated via dissipationless A-body simulations. The cosmic 
variance is first computed on dark matter halos and then con- 
verted to galaxy cosmic variance by applying the galaxy bias. 
The average relative error due to cosmic variance for the z > 3 
massive sample is 35%. This value was added in quadrature to 
the Poisson errors in the number densities we computed. 

2. 7. The Sample 

Our primary sample is composed by those galaxies brighter 
than Ks = 23.4AB, whose redshift is compatible with z > 3, 
with stellar mass M* > 10 11 M 0 , and which lie on the 
CANDELS-COSMOS frame. This selection yields 10 galaxies. 
Given that their associated completeness is ~70%, this is the 
only sample that allows us to perform a quantitatively robust 
analysis. Specifically, all number densities quoted in this work 
are based on the above sample. Their SEDs are shown in 
Figure 6, together with the F160W and color cutout. 

3. RESULTS 

3.1. Massive z > 3 Galaxies 

The distribution of the specific star formation rate (sSFR = 
SFR /A/*) as a function of the effective radius R e for the 
3 < z < 4 sample is shown in Figure 4. Points are color- 
coded according to their stellar mass. The effective radius from 
a Sersic profile fit to the point-source objects is 0711 =b 0701; 
its projection to 3 < z < 4 corresponds to ~0.8 kpc and is 
marked by the solid gray region, identifying a limit for the 
PSF-convolved Sersic profile. This marks an empirically deter- 
mined limit below which we cannot robustly determine the size. 
Given our stellar mass completeness measurement, galaxies sat- 
isfying the 70% completeness limit coincide with those whose 
stellar mass M* > 10 11 M 0 , identified by the red squares. 

In the following, if not otherwise specified, we refer to 
massive galaxies as those with stellar mass M* > 10 11 Mq\ we 
define compact galaxies as those with R e < 2 kpc and quiescent 
galaxies as those whose sSFR < 10“ 11 yr 1 . Our definition of 
quiescent galaxies is based on the work of Kriek et al. (2006), 
where a population of MQGs is spectroscopically identified at 
z ~ 2.3. In their sample, the average sSFR is 4 x 1 0 - 12 yr -i ? 


with all galaxies having log(sSFR/yr _1 ) < —11. For galaxies 
of lo g(M*/M 0 ) = 11, this corresponds to SFR < 1 M 0 yr -1 . 
While galaxies with sSFR > 10“ 10 yr -1 can be reasonably 
considered star forming, those with intermediate sSFR (— 1 1 < 
log(sSFR/yr _1 ) < —10) are galaxies with suppressed star 
formation activities (relative to their stellar masses) with respect 
to the star- forming sequence observed at z ~ 2 (e.g., Whitaker 
et al. 2012; Szomoru et al. 2012), although not necessarily 
quiescent. We therefore adopt a more conservative definition 
of quiescent galaxy (i.e., sSFR < 10“ 11 yr -1 ) in place of 
sSFR < 10 10 yr 1 sometimes used in the literature. We also 
checked Spitzer/ MIPS 24 /xm data relative to each massive 
galaxies. All but one star-forming galaxies are detected at more 
than 3a. No significant MIPS detection is found for two of 
the four MQGs, consistently with their quiescent nature from 
SED modeling. However, two quiescent massive galaxies have 
significant MIPS fluxes (~80 and ~140 /xJy), which would 
imply large SFRs (700-900 M© yr -1 ; Whitaker et al. 2012) if 
the 24 /xm emission were associated with dust-enshrouded star 
formation. We note, however, that at z > 3, the MIPS 24 /xm 
band samples rest-frame wavelengths shorter than 6 /xm, i.e., 
emission from hot dust. As such, the conversion from mid-IR 
fluxes to SFRs is very uncertain, especially at the high redshifts 
targeted in our work. Moreover, additional contamination due 
to the emission from the dusty torus of an AGN can potentially 
contaminate the MIR fluxes probed by the MIPS 24 /xm band. 
For these reasons, we will use the SFRs derived from the 
SED modeling throughout the paper, noting that half of the 
subsample of MQGs could actually be highly obscured, star- 
forming galaxies. Observations in the far-IR (e.g., ALMA) are 
needed to robustly quantify the level of obscured star formation 
and to confirm the quiescent nature of these galaxies. 

The data show that at z > 3, M* > 10 11 M© galaxies that are 
quiescent tend to be compact, while those that are star forming 
are more extended, similar to the distribution found at z ~ 2. 
Most of the galaxies (~70%; see the inset in Figure 4) are char- 
acterized by R e < 2 kpc, with a median of 1 .5 kpc. However, we 
caution that these values refer to the full sample that is not com- 
plete in stellar mass. The average size for the MQGs in the stellar 
mass complete sample is 1.2 kpc (0.6 kpc if the two galaxies 
with MIPS 24 /xm detection are excluded); the corresponding 
value for the massive, star-forming sample is 3.1 kpc. 

The plot also shows a lack of extended (R e > 2 kpc) quies- 
cent galaxies, indicating that the massive quiescent (elliptical) 
galaxies observed in the Local Universe were not yet completely 
formed when the universe was <2 Gyr old. This could also be 
a surface brightness selection effect, although the width of the 
NMBS PSF compared to the CANDELS PSF causes all objects 
with R e < 5 kpc to be detected as point sources; the selection 
effect should then act only on the very extended objects. 

The plot in Figure 4 shows that MQGs were already present at 
z > 3, confirming previous results (see, e.g., Guo et al. 2012 and 
Marchesini et al. 2010, who first characterized the properties of 
a mass-complete sample of 3 < z < 4 galaxies, finding both 
quiescent and star-forming galaxies). For M * > 10 11 M q and 
sSFR <10 11 yr 1 we count four galaxies, corresponding to a 
completeness-corrected number density of n q = 5.2+A 6 ! x KT 6 
Mpc -3 (nq = 3.6^24 x 10 -6 Mpc -3 if the two galaxies with 
MIPS 24 /xm detection are excluded). The comoving volume 
was computed assuming a redshift range 2.8 < z < 4.0. 
The upper and lower error bars were computed following the 
recipe by Gehrels (1986); cosmic variance was finally added in 
quadrature. 
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Figure 6. Observed SED from NMBS (black points) together with EAZY best- fitting SED (solid red curve) of our sample of galaxies with log(M*M 0 ) > 1 1. A color 
cutout centered on the object and based on the F814W, F125W, and F160W filters from CANDELS, with color scheme following Lupton et al. (2004), is presented 
in the upper part of each plot, while a cutout from the F160W filter is shown at the bottom of each plot. The MIPS det. label indicates those objects with a 3cr MIPS 
detection (see the text for details). The angular size of each cutout is 37 1 x 377. 

(A color version of this figure is available in the online journal.) 


Figure 4 clearly shows that CMQGs were already present at 
z > 3. This is one of the main results of this work. Studies 
so far have found this population of galaxies up to z ~ 2 
(see, e.g., Kriek et al. 2006; Bezanson et al. 2009). Our data 
allow us to push back in time the appearance of CMQGs 
from when the universe was ~3 Gyr old to an age of less 
than 2 Gyr. 

As we will show in Section 4.3, these objects do not exist in 
significant enough numbers to explain the abundance of CMQGs 
at z ~ 2. 

3.2. Compact, Star-forming Galaxies at z ~ 3 

The existence of a relation between M * and R e (see, e.g., Shen 
et al. 2003; Mosleh et al. 201 1) implies that we cannot consider 


a single value for the R e when selecting compact galaxies at 
different stellar mass ranges. Assuming R e a M® 32 , valid for 
galaxies at 2.5 < z < 3.5 (Mosleh et al. 201 1), an R e = 2.0 kpc 
for a lo g(M*/M 0 ) =11 galaxy scales to R e = 1.4 kpc for 
1 og(M*/M 0 ) = 10.5 and R e = 1.0 kpc for 1 og(M*/M 0 ) = 
10.0. Figure 4 then shows that there are a number of com- 
pact star-forming galaxies spanning the whole range of stellar 
masses. In particular, and more interestingly, in our sample there 
are five galaxies with high sSFR (log(sSFR/yr _1 ) ~ —9), stel- 
lar mass 10.6 < lo g(M*/M©) < 11, and with R e < 1.4 kpc 
(i.e., they are compact). Their redshift is above z = 2.6 
with 68% confidence level, excluding contamination from low- 
redshift galaxies with broad probability distributions of photo- 
metric redshifts. From Figure 5, the stellar mass completeness 
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Figure 7. Observed SED from NMBS (black points), EAZY best-fitting SED (solid red curve), color cutout built from F814W, F125W, and F160W CANDELS 
frames, and F160W cutout for our sample of compact, star-forming galaxies with 10.6 < log M*/M 0 < 1 1.0. The angular size of each cutout is 377 x 377. 

(A color version of this figure is available in the online journal.) 


of log(M*/M 0 ) ~ 10.6 galaxies is lower than 50% for M/L > 
O.9M 0 /L Oi y, i.e., we could possibly be missing star-forming 
galaxies with significant dust absorption. In Figure 7, we present 
their SEDs along with F160W and color cutouts. We note, how- 
ever, that one out of the five objects has both 5 and 8 /xm excess 
that could be the imprint of a Type 2 AGN. However, there is 
no evidence for this AGN in the rest-frame optical SED and 
we therefore do not think that it is causing a significant bias 
in our sizes, although obscured AGNs are a potential source of 
uncertainty. 


To determine if there is a faint extended component in the 
compact star-forming galaxies and to better assess our size 
measurements, we directly stacked the images for the five 
objects. We normalized each image tile to its peak flux to prevent 
any single object from dominating the stack. The brightness 
profile of the stacked image was analyzed using GALFIT and a 
Sersic profile, obtaining R e = 0753 ± 0702 and n = 3.5 zb 0.2. 
Assuming an average redshift of z ~ 3, this corresponds to 
R e ~ 7.7 kpc. However, we note that visual inspection of the 
images of the individual objects shows that the aspect ratio and 
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Figure 8. Panels on the left show the stacked image for three out of the 
five compact star-forming galaxies with high stellar mass at z > 3, obtained 
after excluding the two galaxies with very different aspect ratio and position 
angle (top) and the residual image resulting from the fit (bottom). The panels 
on the right show on top the circularized radial profile (arbitrary units) 
of the stacked image indicated by the black points. The triple-dotted dashed 
red curve marks the PSF-convolved Sersic profile recovered by GALFIT, while 
the blue dashed line represents the profile of the PSF. The bottom panel shows 
the residuals measured directly from the GALFIT residual image (black points) 
and smoothed difference between the PSF-convolved analytic profile and the 
original input image (solid red line). The light profile closely resembles that of 
the PSF and it does not show evidence for a faint extended halo. 


(A color version of this figure is available in the online journal.) 




Figure 9. Examples of the SFHs resulting from the different evolution scenarios 
assumed in this work. Left panel: from top to bottom are the delayed- 
exponential SFH, the constant delayed-exponential SFH, and the truncated 
delayed-exponential SFH. Right panel: from top to bottom, the exponential 
and truncated exponential SFH (see Section 4.1 for descriptions of the SFHs 
used here). The adopted parameters are representative of the average z > 3 
population for all but the truncated exponential SFH: ^-folding time (r) of 
8 x 10 4 * * 7 yr and age of 3 x 10 8 * * yr. The age of the truncated exponential SFH 
corresponds to 2 x 10 7 yr. Arbitrary normalization and offset were applied in 
order to increase readability. 

(A color version of this figure is available in the online journal.) 


position angle of two galaxies are very different from the other 
three, causing significant broadening of the profile. We therefore 
repeated the stacking excluding these two sources, obtaining 
R e = Of! 12 =b 0?01 (0.9 kpc at z ~ 3) and n = 1 .7 ± 0.2, further 
supporting their compact configuration. The stacked image and 
its surface brightness profile for this second analysis are shown 
in Figure 8. 

These compact, star- forming galaxies with stellar mass 

10.6 < lo g(M*/M 0 ) < 11 are likely candidates for being 
the progenitors of the CMQGs observed at z — 2 , as recent 
works have shown (see, e.g., Patel et al. 2013; Barro et al. 
2013). However, as we will show in Section 4.1, the evolu- 
tion of the stellar mass of this sample of objects does not 
increase above log(M*/M©) = 11 by z = 2.3. Therefore, 
the net result is that the compact, star-forming galaxies with 

10.6 < lo g(M*/M 0 ) < 11 at z ~ 3 will not directly contribute 
to the number density of the more massive CMQGs at z ~ 2. 

4. WHAT WERE THE PROGENITORS OF THE z ~ 2.3 

COMPACT, MASSIVE, AND QUIESCENT GALAXIES? 

4.1. Evolving z > 3 Galaxies to z = 2.3 

In this section, we describe the evolution of the z > 3 
population of massive (M* > 10 11 M©) galaxies down to 
z = 2.3, paying specific attention to the progenitor population 

of the massive, quiescent (sSFR < 10“ 11 yr -1 ) galaxies at 
z ~ 2.3 (Kriek et al. 2006; van Dokkum et al. 2008; Bezanson 

et al. 2009). We used the stellar masses and SFRs from the 
public NMBS catalog. These were computed using the FAST 
code (Kriek et al. 2009), adopting a Kroupa (2001) initial mass 
function (IMF), a delayed-exponential star formation history 

(SFH) and solar metallicity. 

The stellar mass and the SFR of each z > 3 galaxy were 
evolved to z = 2.3 using the GAL AXE V program from the 


Bruzual & Chariot (2003) SSP models using a set of different 
SFHs. Specifically, we adopted five distinct SFHs: a constant 
delayed-exponential SFR (CSF), an exponentially declining 
SFR (E), an exponentially declining with quenching of star 
formation 100 Myr after the observed redshift SFR (TE), 
a delayed-exponential SFR (DE), and its quenched version 
100 Myr after the observed redshift (TDE). We allowed each 
SFH to progress starting at the mass and SFR given by FAST. 
The adopted SFHs are schematically presented in Figure 9. We 
note that the constant delayed-exponential SFH is the result of 
a delayed-exponential SFH, from the formation of the galaxy 
until the epoch of observation at z > 3, and of a pure constant 
SFH, for the following evolution. 

The z > 3 sample and its evolution at z = 2.3 are shown in 
the sSFR -M* plane in the four panels of Figure 10 (we omit the 
truncated delayed-exponential case as its results resemble those 
from the truncated exponential SFH). 

The panels in Figure 10 show that galaxies with sSFR < 
10 -9 yr -1 do not significantly increase their stellar mass with 
time due to their low SFRs, but instead they keep approximately 
the same value, or slightly decrease it, due to the return of stellar 
mass to the gas phase in the ISM. This is valid also for the 
sample of compact, star- forming (log(sSFR) ~ — 9) galaxies 
with stellar masses 10.6 < lo g(M*/M©) <11 presented in 
the previous section. Specifically, there is only one galaxy at 
z ~ 3 with 10.6 < lo g(M*/M 0 ) < 11 that will evolve to 
M* > 10 11 Mq at z = 2.3. This mass evolution occurs for 
two out of the five chosen SFHs (exponentially declining and 
constant delayed exponential), but only for the exponential SFH 
is the sSFR < 10“ 11 yr -1 at z = 2.3. Therefore, galaxies with 

10.6 < 1 og(M*/M 0 ) < 11 at z ~ 3 will not enter the CMQ 
population at z ~ 2 in great numbers, unless they grow their 
stellar masses through merging and/or further quenching of 
their SFRs. Overall, the largest contribution to the change in 
number density in the evolution of the population of MQGs 
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Figure 10. Evolution of the z > 3 galaxy sample in the sSFR-M* plane to z = 2.3 for four different SFHs (left to right, top to bottom: constant delayed-exponential, 
exponential, delayed-exponential, and truncated exponential SFHs). The large red squares mark compact (i.e., R e < 2 kpc) z > 3 galaxies, while the blue smaller 
squares indicate the remaining z > 3 galaxies. The position of each galaxy after evolution is indicated by the green triangles and yellow circles for descendants of 
compact and extended galaxies, respectively. The filled area marks the selection criteria adopted for the massive (Af* > 10 11 Af 0 ) and quiescent (sSFR < 10 -11 yr _1 ) 
sample at z = 2.3. The vertical dashed line indicates the stellar mass limit corresponding to our 70% completeness. Barring significant size evolution, the descendants 
of the red squares that fall within the shaded box indicate the number of predicted z ~ 2 compact, massive, quiescent galaxies. The number of galaxies in the shaded 
box is possibly a lower limit as they are the result of evolving the full z > 3 sample and not only those galaxies from the stellar mass complete sample. In particular, 
there may be galaxies with stellar masses below our completeness limit at z ~ 3 which could nonetheless grow above the limit by z ~ 2.3. The largest contribution to 
the change in number density in the evolution of the population of massive quiescent galaxies comes mainly from the decreased value of the SFR at z = 2.3, rather 
than from an increase in stellar mass with cosmic time. 

(A color version of this figure is available in the online journal.) 


comes mainly from the decreased value of the SFR at z = 2.3, 
rather than from an increase in stellar mass with cosmic time. 

In the following section, we present the number densities of 
MQGs; the number densities of the compact, massive, quiescent 
population are discussed in Section 4.3. As a further and more 
consistent source of comparison, the number density of z > 3 
massive galaxies and of z = 2.3 MQGs was also computed using 
the full NMBS catalog. These two populations were selected 
according to the same limits in stellar mass and SFR used for the 
other samples, i.e., log(M*/M 0 ) >11 and log(sSFR/yr _1 ) < 
— 11. The full list of the number densities computed for the 
different choices of SFH can be found in Table 1 . The comoving 
volume was computed assuming a redshift range 2. 8 < z < 4.0. 
We used upper and lower error bars from Gehrels (1986) 
and added in quadrature the cosmic variance. Figure 3 shows 
that there are six objects with z < 2.8, although all of them 
have stellar masses M* < 10 11 M©. We repeated our analysis 
excluding the six objects with z < 2.8 and obtained results 
quantitatively similar to those presented here. 


4.2. The Evolution of Massive z > 3 Galaxies to z ~ 2 

In order to explore how our results depend on different SFH in 
SED modeling, we adopt a different SFH for the past and future 
history of galaxies in each panel of Figure 10. Specifically, for 
the exponential and truncated exponential SFH, we used stellar 
masses and SFRs computed adopting an exponential SFR and 
a Salpeter (1955) IMF, and converting the SFRs and stellar 
masses to a Kroupa IMF by subtracting 0.2 dex. The location of 
the starting galaxies does not depend much on the SFH used to 
compute the stellar masses and SFRs, meaning that the choice 
of SFH primarily affects the evolved galaxies. 

The number density n M of massive (M* > 10 11 M 0 ) z > 3 
galaxies from our sample is hm = 14.9* 7 6 9 8 x 10 -6 Mpc -3 ; 
Figure 11 shows that it is fully compatible with the number 
density of z ~ 2. 2-2. 5 quiescent galaxies from Brammer et al. 
(201 1). In particular, this implies that, in the absence of external 
processes able to trigger new star formation or increase the 
stellar mass (e.g., galaxy mergers), the whole population of 
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Figure 11. Left panel: number density of the massive z > 3 sample with overlap with the CANDELS field (blue open circle), and computed from the full NMBS 
catalog (red open circle). The filled red square marks the number density of z ~ 2.3 massive quiescent galaxies computed using the full NMBS catalog. We note 
that, while the Brammer et al. (2011) number density refers to quiescent galaxies selected via the UVJ diagram, our measurement of the z ~ 2.3 massive quiescent 
population is obtained applying an sSFR < 10“ 11 yr -1 cut. The blue hashed box at z ~ 2.3 encompasses the range in number density of massive quiescent 
galaxies as predicted by four of our SFH scenarios (DE, TDE, E, and TE) for evolving all galaxies from z > 3 to lower redshift. The vertical extent of the bar also 
takes into account the expected cosmic variance. It is clear from this that there are enough massive galaxies at z > 3 to account for the full massive quiescent galaxy 
population at z ~ 2. The blue filled circle at z = 2.3 represents the number density and total error from the CSF model. The total number density of log(M*/ Af 0 ) >11 
galaxies at z > 3 is consistent with the population of massive quiescent galaxies at z ~ 2; all except the CSFH are plausible SFHs in reproducing the evolution to 
z ~ 2. Right panel: number density of the original massive z > 3 sample (open gray and magenta circles for the measurement obtained using our sample and the full 
NMBS catalog, respectively) and of the compact massive z > 3 sample (thick blue plus sign), which encompasses the same objects as the z > 3 MQ sample, and hence 
has the same number density. The colored boxes at z ^ 2.3 mark the number densities of evolved compact, massive, quiescent galaxies under different scenarios. 
The blue box represents pure evolution in the SFHs with no merging. The red box and points represent the evolution after galaxies have been allowed to merge once 
randomly. The green box includes the effect of the SFH, a random merger, and a post-merger truncation of the SFH. The blue and green bars and the associated points 
have been shifted by an arbitrary value in z to improve readability and should be considered at z = 2.3. The Brammer et al. (201 1) data are plotted only for z > 2, i.e., 
where the vast majority of massive quiescent galaxies are also compact. Also plotted is the number density of 2.2 < z < 2.4 massive, quiescent galaxies from the full 
NMBS catalog (red filled square). The number density of the compact, massive, quiescent population of galaxies obtained with pure SFH evolution to z = 2.3 is only 
marginally consistent with previous determinations; a simple merging model produces number densities in better agreement with previous determinations. Agreement 
at the lor level is found with the number density of z ~ 2 massive, quiescent galaxies computed from the full NMBS data set for the simple evolution scenario; the 
simple merging model increases the level of agreement of the number densities. We caution, however, that the number densities of the evolved samples are perhaps 
lower limits since they are the result of evolving all galaxies at z > 3 and not just those for which we are mass complete. Specifically, there may be galaxies below 
our completeness limit at z ~ 3 which could nonetheless grow above the limit by z ~ 2.3. 

(A color version of this figure is available in the online journal.) 


Table 1 

Number Densities for the Galaxy Populations and SFHs Adopted in This Work 


Redshift 

Population* 1 



Number density b (10 6 

Mpc -3 ) 


z > 3 

M 



1+9^ 




M (Full NMBS) 



24 - 3 - 3 '.3 




MQ 



c 9+4.6 
D - Z -3.1 



Redshift 

Population 



Number density (10 -6 

Mpc -3 ) 


z = 2.3 

MQ (Full NMBS) 



14 7+12-1 
A4 * / -10.5 



Redshift 

Population 

E (10 -6 Mpc -3 ) 

TE (10 -6 Mpc -3 ) 

DE (10 -6 Mpc -3 ) 

TDE (10 -6 Mpc -3 ) 

CSF (10 -6 Mpc -3 ) 

z = 2.3 

MQ 

13 - 2 T 7 6.2 

1 4 0+7.9 
14 - y -6.8 

Q 0+6.2 
y - y -5.0 

14.6T 7 6 8 7 

3 6 +4 - 0 

3.0_24 


CMQ 

6 . 1 ^i 

c o+4.6 
— 3.1 

c 9+4.6 
D ‘ Z - 3.1 

6> 5 3.7 

3 6 +4 - 0 

3.0—24 

Merging 

z = 2.3 

CMQ 

6 7 +51 

9 8 +6 - 2 
4.9 

7 0 +5 - 2 
/,U — 3.8 

Q 1+5.9 
4.7 

0 2 +2 - 6 
U.Z— 02 

Merging+Quenching 

z = 2.3 

CMQ 

12 3 +7 - 0 
1Z ^- 5.9 

9 8 +6 - 2 

V.5— 49 

i c O+8.0 

10.3—7 0 

Q 1+5.9 
^•-4.7 

1 1 1 +6 - 6 
1 5.4 


Notes. 

a Symbols refer to: M, massive galaxies (log (M*/Af Q ) > 11); C, compact galaxies ( R e < 2 kpc); Q, quiescent galaxies (log(sSFR/yr _1 ) < —11); and combinations 
thereof. 

b The comoving volume for the z > 3 and z = 2.3 populations is computed adopting the redshift range 2.8 < z < 4.0. 

massive z > 3 galaxies needs to have its SFR quenched to The comparison of the number density of z > 3 massive 

log(sSFR/yr -1 ) < — llbyz ~ 2. We will explore how mergers galaxies with z = 2.3 MQGs, both computed using the full 

might influence the growth of stellar mass in Section 4.3.1. NMBS catalog, confirms the above result. 
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This is further supported by our SFH predictions. The blue 
bar in the left panel of Figure 1 1 represents the number density 
of galaxies which, evolved from their observed redshifts, are 
massive and quiescent by z = 2.3. The box takes into account 
the spread in values from four of the five adopted SFHs 
(excluding the CSF) and cosmic variance. The blue box is in 
good agreement with measurements of quiescent galaxies from 
Brammer et al. (2011) at z ~ 2-2.5. Overplotted are also the 
values from the individual SFHs. In general, apart from the 
constant delayed-exponential SFH case, it is not possible to 
discriminate among the SFHs, mainly due to the high cosmic 
variance errors (~0.35 relative error). As shown in the left panel 
of Figure 11, four out of five of the SFHs do equally well in 
reproducing the number counts of MQGs at z ~ 2.3. The only 
exception is the CSF that underpredicts the observed number 
density of MQGs at z ~ 2 by a factor of ~4. This suggests 
that the only valid SFHs are the declining or truncated SFHs 
and that the constant delayed-exponential SFH is not a valid 
option in this range of redshift. On the other side, given that the 
number densities of massive, star-forming galaxies at z ~ 2 is 
approximately the same as that of MQGs (Brammer et al. 201 1), 
the fact that the CSF SFH underpredicts the number densities 
of MQG means that the CSF SFH would also underpredict 
the number density for the massive star-forming galaxies at 
z ~ 2. Considered together with the low number of star-forming 
galaxies with CSF SFH that cross the lo g(M*/M 0 ) = 11 
threshold (see the top left panel of Figure 10), the above result 
suggests that high-redshift massive star- forming galaxies might 
be characterized by rising SFHs, in agreement with recent works 
(e.g., Maraston et al. 2010; Papovich et al. 2011). 

Interestingly, Bell et al. (2012) found a similar result when 
comparing z ~ 1 galaxies to the z = 0 quiescent population, 
with the local population of MQGs approximately as numerous 
as the entire massive population at z ~ 1. This could suggest 
that mechanisms for quenching, either the same over the history 
of the universe or of different nature at different epochs, are 
ongoing more or less continuously since z ~ 4. 

4.3. The Progenitors of z ~ 2 Compact, 

Massive, Quiescent Galaxies 

Several works have demonstrated that the vast majority of 
massive quiescent z ~ 2 galaxies are also compact (see Szomoru 
et al. 2012 and references therein). This allows us to directly 
compare our number densities for CMQGs to the number 
densities at z = 2-2.5 from Brammer et al. (2011) and Bezanson 
et al. (2009) and to the number density of MQGs at z = 2.3 we 
measured using the full NMBS sample. 

Bezanson et al. (2009) published an estimate of the number 
density of CMQGs, based on the stellar mass function of 
Marchesini et al. (2009). The corresponding value is marked 
in Figure 11 by the crossed-circle symbol. We increased the 
original error bar to take into account the effects of cosmic 
variance using the Moster et al. (2011) recipe and adding it in 
quadrature. The final error bar is a factor of ~ 1.5 the error bar 
quoted in the original work, but more representative of the true 
uncertainties. 

As is seen in the right panel of Figure 1 1 , the number density 
nc of massive galaxies that are compact at z > 3 (specifi- 
cally, the number density of compact galaxies at z > 3 with 
lo g(M*/M 0 ) > 11 and log(sSFR/yr -1 ) < —11, which also 
corresponds to the number density nQ of MQGs at z > 3) is 
a factor of ~3 smaller compared to the number density of all 
massive galaxies. Nonetheless, nc is consistent within lcr with 


the number density of MQGs computed from the NMBS. This 
agreement is mostly the consequence of the large error bars, as 
the number density of z > 3 CMQ galaxies is systematically 
smaller than the number densities of MQ galaxies we adopted 
as reference at z ~ 2. The evolution to z = 2.3 of the compact, 
massive galaxies using our chosen SFHs (blue box) produces a 
number density which is approximately a factor of >4 smaller 
than the number density of MQGs from Brammer et al. (2011) 
and consistent only at a 2 a level. We can explain the decrease in 
number density between z ~ 3 and z ~ 2 in the context of our 
evolution models: the increase with cosmic time of the fraction 
of mass returned to the ISM decreases the stellar mass. One pos- 
sible source of the discrepancy between our measured number 
density and that of Brammer et al. (201 1) could be the different 
criteria adopted by Brammer et al. (201 1) to select the quiescent 
population (the UVJ color selection; Williams et al. 2009). In 
a recent work, Szomoru et al. (2012) showed that the fraction 
of quiescent galaxies selected at z > 2 using the UVJ method 
is equivalent to an sSFR < 10“ 10 yr -1 criterion. 7 It is worth 
noting that we do not find z > 3 compact, massive, star-forming 
galaxies, and specifically compact, massive galaxies with 
-11 < log(sSFR/yr 1 ) < —10 which, following our evolu- 
tionary models, would eventually become compact, massive, 
quiescent by z ~ 2. This means that our measurement of the 
number density of CMQGs at z ~ 2.3 does not change if we 
use an sSFR that is compatible with that for a UVJ cut. 

When we compare the expected number density of CMQGs 
that have been evolved from z ~ 3 with the measured sample of 
z = 2.3 MQGs from NMBS (nearly all of which are compact), 
we find that the two agree to within la. 

The plot also shows that the number density from Bezanson 
et al. (2009) is higher than our measurements, and compatible 
only with the upper end of Brammer et al. (201 1). One possible 
reason for this discrepancy could be that Bezanson et al. (2009) 
assume the fraction of MQGs to be 0.5, which is likely a too 
optimistic choice (e.g., Dominguez Sanchez et al. 2011). 

Similar to the left panel, in the right panel of Figure 1 1 the 
number densities from the individual SFHs are marked. For the 
compact massive galaxies, the number density from the CSFH 
is compatible with the measurements from the other SFHs, in 
agreement with recent works (see, e.g., Gonzalez et al. 2010; 
Reddy et al. 2012). 

4.3.1. The effects of mergers 

In the previous sections, we showed that the assumed evolu- 
tion models allow us to reproduce within lcr the number density 
of (compact) MQGs at z ~ 2. Despite being consistent within 
the errors, the pure-evolution measurements are still systemati- 
cally below the number density of MQGs at z = 2.3, indicating 
that pure evolution might just be barely enough. Merging and 
quenching after merging are also plausible processes in this 
range of redshift (see, e.g., Khochfar & Silk 2006; Cameron & 
Pettitt 2012). In order to try to understand the possible mecha- 
nisms that could be responsible for the buildup of the compact, 
massive, quiescent population at z ~ 2, we then implemented 
a very simple statistical model for how mergers could affect 
the evolution in the observed number densities. Our toy model 
consists of randomly choosing pairs of galaxies among the 110 


7 We also selected quiescent galaxies in the full NMBS data set using the 
UVJ color-color technique. The sSFRs for quiescent galaxies at 2 < z < 2.5 
and with log (M*/M 0 ) > 11 are lower than log(sSFR/yr _1 ) = —9.84, with a 
75% upper limit log(sSFR/yr _1 ) = —10.3 and a median log(sSFR/yr _1 ) = 
- 11 . 11 . 
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galaxies with z > 3 and with a measurement of the R e and 
leaving the SFH of each galaxy to evolve independently of its 
companion. The stellar mass of the merged pair was finally con- 
sidered as the algebraic sum of the two components; the sSFR 
was computed as the mass-weighted sum of each component 
(or, equivalently, as the sum of the two SFRs divided by the 
total stellar mass), while we assumed for the R e of the merged 
pair the R e of the more massive companion. 

The effect of merging on the number density evolution is 
represented in the right panel of Figure 1 1 by the red box; the 
result is an agreement between the observed number density of 
compact MQGs at z ~ 2 and that from our models, for all the 
SFH but the CSF. In the CSF case, in fact, the number density is 
even smaller than the number density of the population without 
merging. In the framework of our toy model, this is due to the 
high probability that each pair contains a galaxy with high SFR. 
The end product is then biased toward high sSFR galaxies at the 
end of the merging process. 

Given the potential effect of a starburst or an AGN that is 
triggered by the merger, we also examined a scenario in which 
the SF is quenched following the merger. This is supported by 
recent hydrodynamical simulations in which mergers play an 
important role in the gravitational heating of the halo gas, and 
consequently in the suppression of star formation (Johansson 
et al. 2009). However, we note that this last process does not 
improve the agreement significantly, except for the CSF case, 
for which the number density falls well within the values from 
the other SFH. 

Good agreement between the expected and observed num- 
ber densities for our evolved, merged, and quenched model 
is also found when we compare the expected number densi- 
ties to those calculated from the full NMBS sample using a 
log(sSFR/yr -1 ) < —11 cut. This comparison is useful as the 
full NMBS catalog has a higher precision than our sample that 
is limited to the area with CANDELS overlap. Although the 
merging model appears to be in better agreement with the data, 
it is clear that even a simple evolution model with no merging 
provides an adequate match to the data. Therefore, merging and 
quenching, which are both likely processes that are occurring in 
the galaxy population, are not required to explain the number 
densities. 

If the two quiescent galaxies with MIPS detections indeed 
have the extreme SFRs implied by the rest- frame 5.6 /xm fluxes, 
our results on the potential importance of merging would remain 
qualitatively unchanged. This is because these two galaxies are 
only 13%-20% of the whole sample of galaxies that would be 
classified as massive at z ~ 2.3. If the truncated post-merger 
SFH is correct then the results will also remain quantitatively 
similar as the prompt quenching of star formation assumed in 
this model would turn them into MQGs by z ~ 2.3. 

5. DISCUSSION 

According to current semi-analytic models for the formation 
of MQGs, the bulk of the stellar mass was formed as a compact, 
massive spheroid at z > 3 through gas-rich merger events (e.g., 
Oser et al. 2010). Specifically, in the redshift range between 
z = 5 and z = 3, the models predict that the central galaxy 
would still be building up from gas flows which would feed the 
formation of stars in the central region of the galaxy directly, 
forming the concentrated stellar system (Naab et al. 2009). In a 
second stage, quenching mechanisms such as major merger or 
feedback from AGNs or from star formation would convert 
the full population of massive galaxies into the population 


of quiescent massive galaxies observed at later cosmic times. 
Subsequent minor dry mergers would be responsible for the 
increase in size (Naab et al. 2009; Oser et al. 2012), while 
keeping the gain in stellar mass to a factor of <2 (van Dokkum 
et al. 2010). 

Understanding what are the physical mechanisms of star 
formation quenching, when they started to act, and how long they 
took to completely quench star formation is then a central key in 
our knowledge of formation and evolution of local MQGs. While 
our analysis does not provide us with significant information on 
the physical mechanisms, it allows us to put new constraints on 
both when star formation quenching could have happened and 
on how fast it could be. 

In fact, the compatibility of the number density of massive 
z > 3 galaxies with the number density of z ~ 2 MQGs from the 
literature suggests that the population of MQGs at z ~ 2 can be 
completely accounted for by the observed z > 3 population of 
massive galaxies that is subsequently quenched. Specifically, 
when evolving our sample of z > 3 galaxies to z = 2.3, 
a good agreement is obtained introducing quenching of star 
formation after merging, a process which is supported by recent 
hydrodynamical simulations. 

Second, our observations of CMQGs at 3 < z < 4 push 
back in time the appearance of this class of objects to when the 
universe was ~2 Gyr old. 

Finally, considering that, according to models, massive galax- 
ies should still be actively forming stars at 3 < z < 5, then the 
observation of compact massive galaxies that are already quies- 
cent by 3 < z < 4 imposes that the quenching of star formation 
should be a rapid mechanism in massive galaxies, acting on 
timescales of less than 1 Gyr in the early universe, in agreement 
with recent determinations (e.g., Barro et al. 2013). 

6. CONCLUSIONS 

In this work, we used two overlapping public sets of data 
in a region of the COSMOS field to identify the progenitors 
at z > 3 of the CMQGs observed at z ~ 2. Stellar masses, 
sSFRs, and photometric redshifts were taken from the NMBS. 
The sizes were measured on high-resolution CANDELS F160W 
images. The population of z > 3 galaxies was evolved to 
z = 2.3 using the Bruzual & Chariot (2003) models following 
five different SFHs (constant delayed exponential, delayed 
exponential, truncated delayed exponential, exponential, and 
truncated exponential). All the number densities were computed 
assuming a comoving volume corresponding to 2.8 < z < 4.0. 
Our main results can be summarized by the following points. 

1. We discovered four compact, massive (M* > 10 11 M 0 ) 
quiescent (SED-based sSFR < 10“ 11 yr -1 ) galaxies at 
z > 3, corresponding to a completeness-corrected number 
density of nq = 5.2^^ x 10 -6 Mpc -3 . If the two 
galaxies with MIPS detection were excluded, the resulting 
completeness-corrected number density would be nQ = 
3.6+ 4 2 ° 4 x 10“ 6 Mpc” 3 . 

2. For a complete sample of 10 galaxies with lo gM*/M 0 > 
11, we found that the quiescent (sSFR < 10" 11 yr- 1 ) 
galaxies are compact ( R e ~ 1.2 kpc), while the star- 
forming galaxies are extended ( R e ~ 3. 1 kpc), qualitatively 
similar to what is found at lower redshifts (z ~ 2.3). If the 
two quiescent galaxies with MIPS 24 /xm detection were 
considered star forming, the averaged sizes of the quiescent 
and star- forming galaxies would be R e = 0.6 kpc and 
R e = 2.8 kpc, respectively. 
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3. We found five compact ( R e < 1.4 kpc), star-forming 
(sSFR ~ 10 -9 yr _1 ) galaxies at z ~ 3 with stellar masses 
10 10 6 < log(M*/M 0 ) < 10 1L0 . The small effective radius 
was confirmed by a Sersic (1968) profile fitting of the 
stacked image. 

4. The number density of massive z > 3 galaxies is com- 
parable to the number density of z ~ 2 MQGs from the 
literature. The evolution of the number density of the z > 3 
galaxy population to z ~ 2 can be accounted for with a fam- 
ily of decaying or truncated SFHs. The CSF SFH does not 
fit the observed number densities. A model with quenching 
of the SFR between z = 2.3 and 3 does. 

5. When we evolve our compact z > 3 galaxies to z = 2.3, 
we find that the predicted number density of CMQGs is 
consistent at the 1 o level for all of our adopted SFHs. An 
even better agreement is obtained if we quench the star 
formation after the merging event. Such a rapid truncation 
is supported by recent simulations (e.g., Johansson et al. 
2009). 

The above results lead to the following conclusions. 

1 . The number density of MQGs at z ~ 2 can be completely 
accounted for by the observed z > 3 population of 
massive galaxies that is subsequently quenched. Similarly, 
the number density of CMQGs at z ~ 2 is consistent with 
the number density of compact, massive galaxies at z > 3, 
implying that most CMQGs at z > 3 must be quenched by 
z ~ 2. Despite the good agreement within the large error 
bars, the number density of z > 3 CMQGs is systematically 
lower than the number density of MQGs at z ~ 2, most of 
which are compact. This implies the potential need for an 
additional channel for the creation of CMQGs at z ~ 2, 
which may include galaxy merging. 

2. The existence of CMQGs at 3 < z < 4 pushes back in time 
the appearance of this class of objects to when the universe 
was ~2 Gyr old. 

3. Since galaxy formation models predict that at 3 < z < 5 
galaxies should still be building up from gas flows, the 
existence of compact, massive galaxies that are quiescent 
already at 3 < z < 4 implies that the quenching of 
star formation should be a rapid mechanism, acting on 
timescales of less than 1 Gyr in the early universe. 

We note that half of the subsample of massive and quiescent 
galaxies is detected in MIPS, implying large SFRs if the MIR 
emission were associated with dust-enshrouded star formation. 
Since the MIPS band probes rest- frame wavelengths shorter than 
6 /jL m at z > 3 (i.e., emission from hot dust), SFRs derived from 
MIPS are very uncertain and can potentially be contaminated 
by emission from a dusty torus of an AGN. Observations in the 
far-IR (e.g., ALMA) are needed to robustly quantify the level of 
obscured star formation and to confirm the quiescent nature of 
these galaxies. 

The analysis presented in this work is based on a sample size 
of about one hundred objects, resulting from the intersection of 
two catalogs, CANDELS and NMBS on the single COSMOS 
field. Recent projects like the 3D-HST survey (van Dokkum 
et al. 2011; Brammer et al. 2012) will provide accurate redshift 
for ~7000 objects at 1 < z < 3.5 over a total area a factor of 
larger than the one available for this work. This will reduce by a 
large amount both the Poisson noise and the uncertainties due to 
the cosmic variance. Additionally, the increase in photometric 
depth from both CANDELS and the 3D-HST with respect to 
the currently available data sets will also allow us to probe 


the population of galaxies down to smaller stellar masses. 
These improvements will contribute significantly in the near 
future to further understanding the buildup and evolution of 
massive galaxies. 
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